Claims: 



1 . A method for balancing unicast or multicast flows in a multistage non- 
blocking fabric, wherein the fabric comprises at least one internal switching element 
(SE) stage, wherein the stage has / internal switching elements, wherein each internal 
switching element is associated with a unique numerical identifier, and wherein the 
fabric comprises an input SE stage and an output SE stage, wherein the method 
comprises: 

(a) grouping input ports into input sets whereby each input set consists of 
input ports that transmit through the same input SE, and wherein the input sets are 
divided into input subsets, and 

(b) grouping output ports into output sets whereby each output set consists 
of output ports that receive cells through the same output SE, and wherein the output 
sets are divided into output subsets, 

(c) receiving cells into the fabric wherein 

if a cell is a unicast cell, then the cell is associated with an input subset i and 
associated with an output subset j based on the input port and the output port 
of the cell, else 

if a cell is a multicast cell, then the cell is associated with an input subset i and 
associated with multiple output subsets {/} based on the input port and the 
multiple output ports of the cell, 

(d) assigning each cell to a flow, wherein 

(i) if the cells are unicast cells, then the cells which are associated 
with the same input subset and associated with the same output subset are 
assigned to the same flow, else 

(ii) if the cells are multicast cells, then the cells which are 
associated with the same input subset and associated with the output subsets of 
the same output sets are assigned to the same flow, and 

(e) transmitting flows through the internal SE stage wherein cells of a 
particular flow are distributed among the internal switching elements, wherein the 
quantity of the cells of each particular flow transmitted through each internal SE 
differs by at most A, wherein h is positive, 
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wherein the number of subsets of at least one input set or at least one output set is less 
than n, wherein n is the number of ports of that input SE or of that output SE, and 
wherein N is the total number of input ports and output ports, and wherein is the 
maximum number of flows whose cells pass any given link, and wherein n, N 9 Nf 9 h, i, 
j and / are natural numbers, wherein the flow in the fabric is balanced. 

2. The method according to Claim 1 wherein h is one. 

3. The method according to Claim 2 wherein step (e) comprises the sub-steps of: 

(a) associating a unique counter with a flow, wherein a counter is 
designated as Cy, wherein i is the numerical identifier of an associated input subset and 
j is the numerical identifier of an associated output subset; 

(b) initializing the counter with a number less than or equal to /; 

(c) transmitting a cell from the flow through the internal switching 
element associated with a numerical identifier which is equal to the numerical value 
of the counter and 

(d) changing the numerical value of the counter by decrementing or 
incrementing the counter modulus /, and 

(e) stopping if there are no more cells from the flow, otherwise go to step 

(c), 

wherein the sub-steps are performed for each flow. 

4. The method according to Claim 2 wherein at least one of the input ports 
belongs to at least two input subsets, or wherein at least one of the output ports 
belongs to at least two output subsets, or combinations thereof. 

5. The method according to Claim 3 further comprising: 
grouping cell time slots into frames of length F 9 

wherein the counter of each flow is set at the beginning of each frame, and wherein 
the counter is set to Cij=(i+j) mod /. 

6. The method according to Claim 2 further comprising: 



43 



grouping cell time slots into frames of length F, 

allowing in each frame input port (i) to transmit up to ay cells or high-priority cells to 
output port 0), and wherein: 

^a lk <SF-N f , Z"»^SF-N f 

k k 

where S is the switching fabric speedup. 

7. The method according to Claim 6 wherein at each stage only cells that have 
arrived in the same frame are transmitted to the next stage, wherein F=D/3T C or 
F=D/4T C if cells are reordered at the outputs, wherein D is the maximum tolerable 
delay and T c is cell time slot duration. 

8. The method according to Claim 6 wherein: 

S = l + — 
F 

and: ^a ik < F 9 ^ a h r < F , whereby the utilization of the fabric is maximized. 



9. The method according to Claim 8 wherein at each stage only cells that have 
arrived in the same frame are transmitted to the next stage, wherein F=D/3T C or 
F=D/4T C if cells are reordered at the outputs, wherein D is the maximum tolerable 
delay and T c is cell time slot duration. 

10. The method according to Claim 2 wherein sets are divided in a way that 

N f <(S-U)D/T, 

where S is switching fabric speedup, U is targeted utilization of the switching fabric, 
D is the maximum tolerable delay and T c is cell time slot duration. 

1 1 . The method according to Claim 2 further comprising: 
grouping cell time slots into frames of length F 9 

allowing in each frame input port (i) to transmit ay cells or high-priority cells to 
output port (/), and wherein the number of flows sourced by an input SE or bound for 
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an output SE that are balanced starting from different internal SE differ by at most 
one, wherein: 

N, 



SF f - 
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where S is the switching fabric speedup. 
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12. The method according to Claim 1 1 wherein at each stage only cells that have 
arrived in the same frame are transmitted to the next stage, wherein F=D/3T C or 
F=D/4T C if cells are reordered at the outputs, wherein D is the maximum tolerable 
delay and T c is cell time slot duration. 

13. The method according to Claim 1 1 wherein: 




and wherein : 



whereby utilization of the fabric is maximized. 
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14. The method according to Claim 13 wherein at each stage only cells that have 
arrived in the same frame are transmitted to the next stage, wherein F=D/3T C or 
F=D/4T C if cells are reordered at the outputs, wherein D is the maximum tolerable 
delay and T c is cell time slot duration. 



15. The method according to Claim 1 1 wherein the counter of each flow is set at 
the beginning of each frame, and wherein the counter is set to Cy=(i+j) mod /, wherein 
i is the numerical identifier of an associated input subset and j is the numerical 
identifier of an associated output subset, comprising the following steps: 
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(a) transmitting a cell from the flow through the internal switching 
element associated with a numerical identifier which is equal to the numerical value 
of the counter of this flow; and 

(b) changing the numerical value of the counter by decrementing or 
incrementing the counter modulus /, and 

(c) stopping if there are no more cells from the flow, otherwise go to step 

(a), 

wherein the sub-steps are performed for each flow. 

16. The method according to Claim 2 wherein the numbers of flows sourced by an 
input SE or bound for an output SE that are balanced starting from different internal 
SEs differ by at most 1, wherein the subsets are grouped so that Nf fulfills: 



N f < 



2(S-U)F U>- 



2U 2 

where S is the switching fabric speedup, U is targeted utilization of the switching 
fabric, D is the maximum tolerable delay and T c is cell time slot duration. 



17. An article of manufacture for balancing unicast or multicast flows in a 
multistage non-blocking fabric, wherein the fabric comprises at least one internal 
switching element (SE) stage, wherein the stage has / internal switching elements, 
wherein each internal switching element is associated with a unique numerical 
identifier, and wherein the fabric comprises an input SE stage and an output SE stage, 
wherein the article comprises: 

a machine readable medium containing one or more programs which when 
executed implement the steps of: 

(a) dividing input ports into input sets whereby each input set consists of 
input ports that transmit through the same input SE, and wherein the input sets are 
further divided into input subsets, and 

(b) dividing output ports into output sets whereby each output set consists 
of output ports that receive cells through the same output SE, and wherein the output 
sets are further divided into output subsets, 
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(c) assigning each cell received into the fabric to a flow comprising: 

(i) if a cell is a unicast cell, then associating the cell received into 
the fabric with an input subset and with an output subset based on the 
input port i and the output port j of the cell, wherein cells which are 
associated with the same input subset and associated with the same 
output subset are assigned to the same flow, else 

(ii) if a cell is a multicast cell, then the cell is associated with an 
input subset and associated with multiple output subsets based on the 
input port / and the multiple output ports {/} of the cell, wherein cells 
which are associated with the same input subset and associated with 
the output subsets of the same output sets are assigned to the same 
flow, and 

(d) transmitting flows through the internal SE stage wherein cells of a 
particular flow are distributed among the internal switching elements, wherein the 
quantity of the cells of each particular flow transmitted at each internal SE differs by 
at most h, wherein h is positive, 

wherein the number of subsets of at least one input set or at least one output set is less 
than n, wherein n is the number of ports of that input SE or of that output SE, and 
20 wherein N is the total number of input ports and output ports, and wherein N/, is the 

maximum number of flows whose cells pass any given link, and wherein n 9 N 9 N f , h, i, 
j and / are natural numbers. 

18. The article according to Claim 16 wherein h is one. 
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19. The article according to Claim 18 wherein (d) comprises 

a machine readable medium containing one or more programs which when executed 
implement the steps of: 

(a) associating a unique counter with a flow, wherein a counter is 
designated as c,y, wherein i is the numerical identifier of an associated input subset and 
j is the numerical identifier of an associated output subset; 

(b) initializing the counter with a number less than or equal to /; 
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(c) transmitting a cell from the flow through the internal switching 
element associated with a numerical identifier which is equal to the numerical value 
of the counter; and 

(d) changing the numerical value of the counter by decrementing or 
incrementing the counter modulus /, and 

(e) stopping if there are no more cells from the flow, otherwise go to step 

(c), 

wherein the sub-steps are performed for each flow. 

20. An apparatus for balancing unicast or multicast flows in a multistage non- 
blocking fabric, wherein the fabric comprises at least one internal switching element 
(SE) stage, wherein the stage has / internal switching elements, wherein each internal 
switching element is associated with a unique numerical identifier, and wherein the 
fabric comprises an input SE stage and an output SE stage, the apparatus comprises: 
a flow control device configured to: 

(a) divide input ports into input sets whereby each input set consists of 
input ports that transmit through the same input SE, and wherein the input sets are 
further divided into input subsets, and 

(b) divide output ports into output sets whereby each output set consists of 
output ports that receive cells through the same output SE, and wherein the output 
sets are further divided into output subsets, 

(c) assign each cell received into the fabric to a flow comprising: 

(i) if a cell is a unicast cell, then associate the cell received into 
the fabric with an input subset and with an output subset based on the 
input port i and the multiple output ports {/'} of the cell, wherein cells 
which are associated with the same input subset and associated with 
the same output subsets are assigned to the same flow, else 

(ii) if a cell is a multicast cell, then the cell is associate with an 
input subset and associated with multiple output subsets based on the 
input port i and the multiple output ports {/} of the cell, wherein cells 
which are associated with the same input subset and associated with 
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the output subsets of the same output sets are assigned to the same 
flow, and 

(d) transmit flows through the internal SE stage wherein cells of a 
particular flow are distributed among the internal switching elements, wherein the 
quantity of the cells of each particular flow transmitted at each internal SE differs by 
at most h, wherein h is positive, 

wherein the number of subsets of at least one input set or at least one output set is less 
than «, wherein n is the number of ports of that input SE or of that output SE, and 
wherein N is the total number of input ports and output ports, and wherein N/ is the 
maximum number of flows whose cells pass any given link, and wherein n, N, N/ 9 h, i, 
j and / are natural numbers. 

21 . The apparatus according to Claim 20 wherein (d) comprises 
a counter module configured to: 

(a) associate a unique counter with a flow, wherein a counter is designated 

as dj; 

(b) initialize the counter with a number less than or equal to /; 

(c) transmit a cell from the flow through the internal switching element 
associated with a numerical identifier which is equal to the numerical value of the 
counter; 

(d) change the numerical value of the counter by decrementing or 
incrementing the counter modulus /, and 

(e) stop if there are no more cells from the flow, otherwise go to step (c), 
wherein the sub-steps are performed for each flow. 

22. A multistage non-blocking switch comprising: 

(a) at least one internal switching element (SE) stage, wherein the stage 
has / internal switching elements, wherein each internal switching element is 
associated with a unique numerical identifier, 

(b) an input SE stage, 

(c) an output SE stage, 
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(d) input ports which are divided into input sets wherein each input set 
consists of input ports that transmit through the same input SE, and wherein the input 
sets are further divided into input subsets, and 

(e) output ports which are divided into output sets wherein each output set 
5 consists of output ports that receive cells through the same output SE, and wherein the 

output sets are further divided into output subsets, and 

(f) a flow assignment module wherein the module assigns cells which are 
received into the fabric to a flow, wherein the assignment comprises 

(i) if a cell is a unicast cell, then the cell is associated with an input 
10 subset and associated with an output subset based on the input port i and the 

output port j of the cell, wherein cells which are associated with the same 
input subset and associated with the same output subset are assigned to the 
same flow, else 

(ii) if a flow is a multicast flow, then each cell is associated with an 
1 5 input subset and associated with multiple output subsets based on the input 

port i and the multiple output ports {/} of the cell, wherein cells which are 
associated with the same input subset and associated with the output subsets of 
the same output sets are assigned to the same flow, 

20 whereby flows are transmitted through the internal SE stage wherein cells of a 

particular flow are distributed among the internal switching elements, wherein the 
quantity of the cells of each particular flow transmitted at each internal SE differs by 
at most h, wherein h is positive, 

25 wherein the number of subsets of at least one input set or at least one output set is less 
than «, wherein n is the number of ports of that input SE or of that output SE, and 
wherein TV is the total number of input ports and output ports, and wherein N/ 9 is the 
maximum number of flows whose cells pass any given link, and wherein n y N, Nf, h, i, 
j and / are natural numbers. 
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23. The fabric according to Claim 22 wherein the assignment module comprises a 
lookup table. 
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24. The fabric according to Claim 22 wherein the assignment module 

(a) associates a unique counter with a flow, wherein a counter is 
designated as c,y, wherein i is the numerical identifier of an associated input subset and 
j is the numerical identifier of an associated output subset; 

(b) initializes the counter with a number less than or equal to /; 

(c) transmits a cell from the flow through the internal switching element 
associated with a numerical identifier which is equal to the numerical value of the 
counter; 

(d) changes the numerical value of the counter by decrementing or 
incrementing the counter modulus /, and 

(e) stops if there are no more cells from the flow, otherwise go to step (c), 
wherein the sub-steps are performed for each flow. 

25. A method for balancing unicast or multicast flows in a multistage non- 
blocking fabric, wherein the fabric comprises at least one internal switching element 
(SE) stage, an input SE stage and an output SE stage, wherein the method comprises: 

(a) receiving cells into the fabric wherein each cell is associated with an 
input subset and associated with an output subset according to the source and 
destination address of the cell, 

(b) assigning each cell to a flow, wherein cells sourced from the same 
input subset, and bound for the same output subset, or multiple output subsets, are 
assigned to the same flow, and 

(c) transmitting flows through the internal SE stage wherein cells of a 
particular flow are distributed among the internal switching elements, wherein the 
cells of each particular flow transmitted at each internal SE differs by at most h, 
wherein h is positive, 

whereby the flow in the fabric is balanced. 
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